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Microwave Kinetic Inductance Detectors (MKIDs) are superconducting detectors capable of counting single 
photons and measuring their energy in the UV, optical, and near-IR. MKIDs feature intrinsic frequency 
domain multiplexing (FDM) at microwave frequencies, allowing the construction and readout of large arrays. 
Due to the microwave FDM, MKIDs do not require the complex cryogenic multiplexing electronics used 
for similar detectors, such as Transition Edge Sensors (TESs), but instead transfer this complexity to room 
temperature electronics where they present a formidable signal processing challenge. In this paper we describe 
the first successful effort to build a readout for a photon counting optical/near-IR astronomical instrument, 
the ARray Camera for Optical to Near- infrared Spectrophotometry (ARGONS). This readout is based on open 
source hardware developed by the Collaboration for Astronomy Signal Processing and Electronics Research 
(CASPER). Designed principally for radio telescope backends, it is flexible enough to be used for a variety of 
signal processing applications. 
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I. INTRODUCTION 

ARCONS, the ARray Camera for Optical to Near- 
infrared Spectrophotometry, was successfully commis- 
sioned at the Coude focus of the Palomar 200 inch tele- 
scope on July 28, 2011. ARCONS is based on a 32x32 ar- 
ray of Microwave Kinetic Inductance Detectors (MKIDs) , 
which are photon-counting detectors that have an in- 
trinsic energy resolution R^20-150, enabling them to 
perform low-resolution spcctroimaging without filters or 
dispersive optics^. ARCONS was designed to be sensi- 
tive in the 0.4-1.1 fim wavelength range and can time- 
tag photons to 1 (IS without read noise or dark current. 
ARCONS is the first astronomical instrument based on 
optical/near-IR MKIDs, and is the largest non-dispersive 
optical/near-IR spectrophotometer fielded by a factor of 
10. More details on the instrument design and goals can 
be found in Mazin et al^. 

MKIDs work on the principle that incident pho- 
tons change the surface impedance of a superconductor 
through the kinetic inductance effeciP. The kinetic in- 
ductance effect occurs because energy can be stored in 
the supercurrent of a superconductor. Reversing the di- 
rection of the supercurrent requires extracting the kinetic 
energy stored in the supercurrent, which yields an extra 
inductance. This change can be accurately measured by 
placing this superconducting inductor in a lithographed 
resonator. A microwave probe signal is tuned to the res- 
onant frequency of the resonator, and any photons which 
are absorbed in the inductor will imprint their signature 
as changes in phase and amplitude of the probe signal. 
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Since the measured quality factor of the resonators is high 
and their microwave transmission off resonance is nearly 
perfect, multiplexing can be accomplished by tuning each 
pixel to a different resonant frequency with lithography 
during device fabrication. A comb of probe signals (one 
frequency for each resonator) can be sent into the detec- 
tor array, and room temperature electronics can recover 
the changes in amplitude and phase without significant 
crosstalk^, as shown in Figure [l] 

Low temperature operation places serious constraints 
on any cryogenic detector. The obvious advantage of 
MKIDs over competing cryogenic technologies like Tran- 
sition Edge Sensors^ (TESs) is the elimination of the 
cryogenic electronics required for multiplexing* ^. For 
ARCONS, only two microwave feed lines are used to 
read out 1024 MKIDs operating at 100 mK. However, a 
trade-off exists between the MKID's cryogenic simplicity 
and the complexity of the room temperature electron- 
ics. In particular, the signal processing task of decipher- 
ing the complicated sum of all resonator probe signals is 
not unlike the challenge presented to modern communi- 
cation engineers when developing multi-channel cell base 
transceiver stations. In this paper, we discuss in detail 
the MKID readout developed for ARCONS. Although 
various aspects are unique to ARCONS, the readout is 
broadly applicable to any other MKIDs orMSQUIEPi ap- 
plication, such as submillimeter cameragSHSl gr dark mat- 
ter detectorJii], Our approach differs from previous FDM 
MKID readouts-^-"^^ in that it uses a powerful and flexi- 
ble channelization technique (Section |lIIB | better suited 
to the high speeds needed for near-IR/Optical/X-ray 
photon detection instead of a long, single stage Fourier 
Transform to discriminate between readout tones. This 
readout also and covers significantly more bandwidth 
than previous efforts. 
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FIG. 1. The basic operation of an MKID, republished from 
Day et al]r^. (a) Photons with energy hv are absorbed in a su- 
perconducting film, producing a number of excitations, called 
quasiparticles. (b) To sensitively measure these quasiparti- 
cles, the film is placed in a high frequency planar resonant 
circuit. The change in the surface impedance of the film fol- 
lowing a photon absorption event pushes the resonance to 
lower frequency and changes the amplitude of the transmis- 
sion through the circuit. The dotted line in (c) shows the 
change of the transmitted microwave signal due to an inci- 
dent photon. To monitor the resonant circuit, it is continu- 
ously excited with a microwave signal, (d) The phase of the 
transmitted microwave signal depends on the proximity of the 
continuous microwave signal frequency and the variable res- 
onant circuit frequency. The energy of an absorbed photon 
can be determined by measuring the degree of phase shift of 
the transmitted microwave signal. 



II. HARDWARE 



Architecture Computing Hardware (ROACH) boarcP^. 
The core of the ROACH board is a Xilinx Virtex 5 
SX95T field programmable gate array (FPGA) used for 
demanding signal processing operations. In addition the 
ROACH board accommodates external hardware to gen- 
erate and digitize analog signals required to read out 
MKIDs. However, the MKID resonant frequencies are 
above the range accessible to common digital-to-analog 
(DAC) and analog-to-digital (ADC) converters. There- 
fore, an intermediate step must be taken to mix the 
"baseband" signals, which can be processed by the ADC 
and DAC, to the higher frequencies at which the the 
MKIDs resonate. To do this, we employ the quadra- 
ture amplitude modulation strategy generally employed 
in the digital communications field: A local oscillator 
(LO) is used to generate a continuous microwave tone 
at frequency, /loj near the resonators' frequencies. A 
baseband signal is precomputed and output from a two- 
channel DAC. For each resonator, the baseband signal 
is composed of two waveforms: a sine wave at the reso- 
nant frequency minus Jlo (Ibb), and the same waveform 
shifted -90 degrees out-of-phase (Qbb)- This complex 
baseband signal then multiplied by the the complex LO 
to produce a probe signal at the desired MKID frequency, 
lBBC0s{2n ft)+Q BBsin{2n ft) . After passing through the 
device and amplifiers the probe signal is mixed back down 
to the baseband with the same LO where it can be dig- 
itized with a two-channel ADC. Using this method with 
two-channel ADCs and DACs allows us to double the 
bandwidth available to a single DAC, and probe MKIDs 
above and below the Jlo- 

After digitization, the signals are passed to the FPGA 
since the data rate is far too high to store for offline 
processing. Once in the FPGA, the essential task is 
to break down the broadband signal emerging from the 
MKIDs, parsing it into the narrowband frequency bins 
constituting each channel in a process known as "chan- 
nelizing." We are using a two stage channelization core 
that uses a polyphase filter bank (PFB)iSl followed by a 
time-multiplexed direct digital downconverter to allow us 
to readout 256 resonators in 512 MHz of bandwidth, as 
discussed in Section IIIIBI A schematic for the readout 
hardware is shown in Figure [2j 



A. Overall Approach 

In order to read out an array of MKIDs, one must 
generate probe signals at the resonant frequency of each 
MKID. The sum of these waveforms, or "frequency 
comb," is sent through the device, where each detector 
imprints a record of its illumination on its correspond- 
ing probe signal. The frequency comb is then amplified 
with a cryogenic HEMT and brought outside the cryo- 
stat, where it is digitized and the phase and amplitude 
modulation of each individual probe signal is recorded 
with room temperature electronics. 

The MKID readout we developed to perform these op- 
erations is based on the CASPER's Reconfigurable Open 



B. Design Requirements 

The requirements for a MKID readout are relatively 
straightforward. 

• The readout must not introduce significant noise 
above the system noise floor set by the cryogenic 
HEMT amplifier with a noise temperature of ~6 
K. 

• For ARGONS, the entire readout system must be 
capable of reading out 1024 resonators in ^2 GHz 
of bandwidth. 




0.5-4.5GHZ Synthesizer 



Baseband Clock Synthesizer 



FIG. 2. The block diagram of the SDR readout for ARGONS. 



• The output bandwidth of the readout must be wide 
enough to faithfully reconstruct the desired data. 

• Crosstalk between channels greater than 250 kHz 
apart should be less than 1%. 

Conventional electronic phase noise is measured with 
respect to the origin of the IQ plane. Although it is 
often convenient to characterize a MKID's phase noise 
with respect to the center of the resonator's IQ loop, 
we maintain the more general convention here. All men- 
tions of phase noise are calculated and measured with 
respect to the origin of the IQ plane. The double side- 
band (DSB) phase noise of the HEMT is given by the 
simple expression, ksTn/P, where r„ is noise tempera- 
ture of the amplifier, and P is the power on the input. For 
ARCONS, the readout power for each MKID is roughly 
— 100 ± 15 dBm. For -85 dBm readout power, off res- 
onance, and T„ = 6 K, we expect -106 dBc/Hz for the 
HEMT phase noise. Ideally, the readout electronics will 
contribute noise well below this value. 

As the readout power is increased, the phase noise 
of the HEMT decreases placing stricter requirements on 
the readout electronics. Given an ADC's voltage noise 
specification, the number of channels n an ADC can 
read out at a given power without adding to the HEMT 
noise can be calculated. The results are shown in Fig- 
ure [3j The contours of Fig. |3] are calculated by scaling 
the HEMT noise with a channel-dependent factor giving, 
kBTn/{Prnax-P)^ ^^'^ assuming a 3 dB margin between the 
HEMT and the ADC noise. Pmax sets the full scale ampli- 
tude achievable by each tone when digitizing a multi-tone 
waveform. Generally, Pmax is minimized by assuming the 
phase of each tone is random resulting in Pmax ex: n^'^ 
for n >> 1. Specifically, we find that Pmax — 2.75 x n^/^ 
provides a reasonable estimate and was assumed when 
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FIG. 3. The required noise floor density in dBc/Hz of an 
ADC as a function of the number of resonators being read 
out and the readout power per resonator for an amplifier noise 
temperature of 6 K. With a 512 MSample/s sample rate, 10.6 
ENOB, we expect about -147 dBc/Hz for the ADG's phase 
noise. Even at the relatively high readout power -85 dBm this 
would allow for reading out over 1000 channels. Alternatively, 
256 channels could be read out with power as high as -70 dBm. 



calculating Figure [3J (See Section V. for further discus- 
sion.) 

The 1024 detectors in the ARCONS are contained in 
two arrays on a single wafer, with each array accessed 



by a single microwave feed line also fabricated on the 
surface of the wafer. Each array contains 512 resonators 
separated by approximately 2 MHz occupying roughly 
1 GHz bandwidth above 4 GHz. As discussed below, a 
single readout board can read out 256 resonators in up 
to 512 MHz of bandwidth, meaning that two readout 
boards, multiplexed with a power combiner and power 
splitter, are required for each feedline on the MKID array. 
A total of four boards is then required for the complete 
readout. 

Finally, we require a 500 kHz output sample rate for 
each resonator, which allows us to capture the pulse pro- 
file generated by the photon faithfully, yet minimizing 
crosstalk between resonators with close resonant frequen- 
cies. 



C. IF Board 

Since the MKID resonant frequencies are above the 
range accessible to the ADC and DAG, which can only 
record or generate Nyquist-limited signals < 256 MHz, 
we must mix the base band signals, with a microwave 
local oscillator near the resonator frequency range. The 
product of the LO and the two base band signals, called 
the intermediate frequency (IF), can then be used to 
probe MKIDs within ±256 MHz of the LO frequency. We 
have constructed a circuit board containing two Analog 
Devices ADF4350 wideband synthesizer, which can be 
programmed to generate an output frequency between 
137.5 MHz and 4.4 GHz. One synthesizer provides the 
LO for mixing and the other provides the 512 MHz logic 
clock for the ADG, DAG, and FPGA. In addition, this 
IF board contains additional room temperature hardware 
required for the readout: two IQ mixers for mixing to 
and from the IF; three IF wideband amplifiers; one base- 
band amplifier; three programmable attenuators; and a 
frequency doubler used to generate LO frequencies above 
4.4 GHz. The key figure of merit for the IQ mixers is the 
sideband suppression, which we measure to be -30 dBc. 
Physically, the IF board connects to the DAG and ADG 
boards via SMA connectors, and provides IF out and in 
SMA connectors which lead to the cryostat housing the 
MKIDs. 



D. ADC/DAC Boards 

The two pieces of hardware that place the most strin- 
gent limits on the readout electronics are the DAG for 
generating the probe signals, and the ADG for digitiz- 
ing the signal for signal processing. As discussed above, 
the signal to noise ratio (SNR), and sample rate are the 
crucial figures of merit. In general, ADGs underperform 
DAGs and therefore place the strictest limitations. We 
selected a 12-bit, 550 MSample/s ADG from Texas In- 
struments (ADS5463), which has a SNR of approximately 
64 dB and an effective number of bits (ENOB) of 10.6 for 




FIG. 4. One of four ROACH assemblies including the ADC, 
DAC, and IF boards. 



a full scale input across the bandwidth. The TI ADS5463 
has a noise fioor density -201og(2iO-S) - 101og(A^s/2) of 
-147 dBc/Hz, where Ng is the ADG sample rate. Ac- 
cording to Figure [3J this allows reading out over 1000 
channels per board, even at the relatively high readout 
power of -85 dBm. Alternatively, 256 channels could be 
read out with power as high as -70 dBm before ADG 
noise becomes comparable to the HEMT noise. 

For the DAG, we selected the 16-bit 1 GSample/s DAG 
(DAG5681) from Texas Instruments, which has a SNR 
of roughly 75 dB and noise floor of -159 dBc/Hz; again, 
well below the SNR requirements imposed by the HEMT, 
shown in Figure [31 

The baseband signals are generated by two DAGs, one 
for each I and Q, on a custom board shared with the 
ADGs. After the IF signal is passed through the MKIDs 
and mixed down to the baseband, it is recorded by two 
ADGs. The ADG/DAG boards mate directly with the 
peripheral ports of the ROAGH board, and the SMA 




III. FIRMWARE 



A. Comb Generation 



FIG. 5. Custom crate to house up to eight ROACH assem- 
bhes. The SMA connectors to each IF board are visible on 
the front panel. 



connectors on the IF board. The DAC board also routes 
the logic clock generated on the IF board to the FPGA, 
divided by a factor of two. The ADC board has an ad- 
ditional SMA input for routing a synchronization signal 
via the IF board to the FPGA. This was used to route a 
one pulse-per-second (PPS) timing signal from an exter- 
nal GPS. For ARGONS, the DAG and ADG were clocked 
at 512 MHz, and the FPGA at 256 MHz. The SNR of 
both ADG and DAG boards are within 3 to 5 dB of the 
specifications in the data sheets. 



E. Complete Readout 

We have constructed a custom crate to house the four 
ROAGH boards required to readout the entire MKID ar- 
ray for a compact installation at the observatory. It can 
be upgraded to house eight boards. The crate is pictured 
in Figure [5J Within the crate are four separate assem- 
blies containing a ROAGH, ADG/DAG, and IF boards, 
as pictured in Figure |4J 

There are two additional components that make the 
readout complete, and are essential to the accurate 
estimation of the photon arrival times: a Meinberg 
(GPS170PEX) GPS clock mounted within the data ac- 
quisition PG, and a Stanford Research (FS725) rubidium 
10 MHz frequency standard. The GPS generates a 1 PPS 
signal to which the rubidium frequency standard is syn- 
chronized, and the 10 MHz signal is used by both the IF 
and clock synthesizers. With these, we are able to record 
the time with better than 1 fis resolution and maintain 
synchronization between the four ROAGH boards. 



The most computationally intensive signal processing 
tasks are performed on the FPGA as firmware, including 
the generation of the input signal comb. Rather than di- 
rectly generating a waveform for each resonant frequency 
on the FPGA, a waveform defined as the sum of all res- 
onant frequencies is loaded into a portion of the DRAM 
memory. Each memory location contains data accessed 
sequentially as a look-up table (LUT) and output to the 
DAG. The FPGA is clocked at 256 MHz, while the ADG 
and DAG are clocked at 512 MHz. Therefore, in order 
to generate the waveform output for each DAG channel 
/ and Q, two consecutive 16 bit time samples must be 
presented to the DAG at each FPGA clock cycle. 



B. Channellzer 

The remainder of the FPGA resources are dedicated 
to processing the data digitized by the ADG. The first 
step and most mathematically involved process is the 
channelization. By design, the resonant frequencies of 
the MKIDs are evenly spaced about 2 MHz apart. The 
designed quality factor of the MKIDs and physics of the 
superconducting material sets the channelizer bandwidth 
for each resonator at 250 kHz. Variations in the fabri- 
cation are unavoidable, however, and the actual spacing 
of the resonant frequencies can vary considerably. This 
constraint means the channelizer must be able to accom- 
modate 500 kHz wide (double side band) channels ar- 
bitrarily spaced in the 512 MHz bandwidth available to 
each ROAGH board. Digital down conversion (DDG) is 
the most straight-forward and common solution to this 
task. However, the hardware demands of this scheme are 
high. We estimate that the number of channels processed 
per FPGA can be a factor of at least 10 higher with the 
more sophisticated, albeit complicated, scheme shown in 
Figure |6] 

We perform the channelization in two stages. The 
first is a coarse channelization resulting in wide, evenly 
spaced, overlapping bins with frequency widths equal to 
twice the frequency spacing. Figure uT a) shows an exam- 
ple of the overlapping bins output from the first stage of 
channelization. This coarse channelization is done with 
a twice over-sampled Fast Fourier Transform (FFT) pre- 
ceded by a finite impulse response filter (FIR) . The com- 
bined effect is to yield a Polyphase Filter Bank (PFB)P^. 
The time multiplexed frequency bins of the L = 512 sam- 
ples FFT are output from the first stage. With the FPGA 
clock running at 256 MHz, two samples per frequency bin 
per clock cycle are output from the PFB. Since a FFT 
has a sine-like frequency response, the time stream is first 
conditioned with the FIRp^s . This FIRpfb is a FIR fil- 
ter with a set of coefficients drawn from a sine function. 
Ideally, this convolution with a sine function in the time 
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FIG. 6. Schematic of channelizer firmware. 
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FIG. 7. (a) Three sample frequency bins output from the 1st stage of channelization with the PFB. Each bin is 2 MHz wide, 
with ~1 MHz overlap with adjacent bins, (b) The irregularly spaced, narrow frequency bins output from the 2nd stage. The 
bin width is 500 kHz wide and centered on the resonant frequency for a single pixel. 



domain results in rectangular frequencies output from the 
FFTs. More taps in the FIRp^^ gives more rectangular 
frequency bin widths, however, this also consumes signif- 
icant FPGA resources that could be used in later stages 
of the firmware. In order to produce the oversampled 2 
MSample/s/channcl output, two copies of the digitized 
time stream from the ADC are sent to two FIRpi?BS and 
FFTs to be processed in parallel, with one copy delayed 
by L/2. In addition, the odd frequencies output from 
the non-delayed block are multiplied by —1 in order to 
maintain consistent phases of the output between the two 
FFTs. The output of these two FFTs are then treated as 
consecutive time samples. See the appendix for further 
mathematical details on the PFB. 

The second stage is a fine channelization done with the 
output of the first stage with DDC. The 256 channels are 
presented at the output of the first stage in a time mul- 
tiplexed fashion with a sample rate of 2 MSample/s (2 
samples/FPGA clock cycle). Unless the probe signal is 



exactly on a frequency bin center of the first stage, the 
output will beat with a frequency that is the difference 
between the probe signal for a particular resonator and 
the nearest bin center frequency from the first stage. This 
is also the frequency with which the signal is digitally 
down-converted. Rather than using the conventional di- 
rect digital synthesis of this signal, we pre-calculate it 
as a LUT and store it in the ROACH board's DRAM. 
This is efficient and convenient since the DAC LUT and 
the DDC LUT each use 64 bits of the 128 bit DRAM-to- 
FPGA data bus width. The DDC LUT from the DRAM 
is presented in a time multiplexed fashion to match the 
channels' output from the first stage. 

The complex signal (/, Q) emerging from the mixing 
stage must now be filtered with a low-pass filter. As 
stated above, we choose a 20-tap, Hamming filter with 
a 250 kHz cutoff frequency. In addition, this output is 
down-sampled by a factor of two. Figure UVo) shows an 
example of the fine channelization. The output from the 



DDC could be downsampled an additional factor of two 
at this point, however for ease of implementation this was 
not done. 



C. Phase Conversion and Matched Filtering 

After the channelization, the phase of the complex sig- 
nals is measured, again in a time multiplexed manner. 
The MKIDs in ARGONS have little information in their 
amplitude response, (J^ + Q^)^/'^, so it is ignored^. Given 
/ and Q, the phase (/) = tan~^(Q//) is calculated effi- 
ciently on the FPGA using a GORDIG algorithm. 

The phase time series is then filtered with a pro- 
grammable FIR with 26 taps, which gives access to 26 jjls 
of data for the filter and allows for pulse arrival time de- 
termination to 1/is. For inputs a;„, the FIR is defined 
conventionally for outputs y„ as 
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where M — 25, and ft.,„ are the FIR taps. This pro- 
grammable filter can be loaded with arbitrary coeffi- 
cients giving wide flexibility. For single photon detec- 
tion, we find that a simple 250 kHz low-pass filter, or a 
matched (Weiner) filtep^^ provide the best results. For 
the matched filter, the coefficients are defined using a 
template for the expected pulse shape. In particular, if 
the normalized pulse template has a profile Pm, the co- 
efficients in the above equation are h„i — P(M-m)- This 
is similar to the technique of pulse compression used in 
radar and sonar technology^^. Like pulse compression, 
it allows for an accurate estimation of the pulse arrival 
time. In addition, one can make use of the noise auto- 
correlation function for a given channel when defining its 
Pm- When this is done, this filter is often referred to as 
an "optimal" linear filter. 



D. Event Triggering and Photon Packet Output 

After the data passes the matched filter, it must be 
analyzed for photon events. A significant variation of 
the phase, 0, from the mean indicates the absorption 
of a photon. A threshold must be determined for each 
channel to demarcate incident photons and background 
electronic noise. This is done by measuring the mean 
and standard deviation of each channel while dark, i.e., 
not illuminated by the sky. A small memory is allocated 
on the ROAGH board to accumulate a time series of 2^^ 
samples for a given resonator. 100 snapshots of such data 
are offloaded to the control PG and used to determine 
the mean and standard deviation for each channel. Each 
threshold is then stored on the FPGA. 

The trigger logic is rather simple. If (f) exceeds the 
threshold, and (P(f)/dt^ < 0, the value for (J>q is recorded 
as the peak pulse height. Excessive high frequency noise 



can cause multiple triggers for a single event. To avoid 
this, we implement a 100 /iS deadtime for each channel, 
after a pulse has been triggered. In addition, the two 
values of the phase before and after 0o are recorded. 
Recording three values of (f> requires negligible additional 
bandwidth, but may be valuable for post-acquisition es- 
timation of the pulse height and arrival time. 

The arrival time of the photon is also recorded. A 
counter runs continuously on each FPGA at 256 MHz. 
This counter is reset by the rising edge of each 1 PPS 
signal from the GPS. Since the 256 channels are time 
multiplexed on the FPGA, we are able to resolve the 
arrival times of photons on each channel to 1 fis. 

The three values 0_i, 0o:0+i5 the pulse arrival time, 
and the channel number are then stored in a small mem- 
ory on the FPGA acting as a packet buffer for offloading 
the data to the control PG. 



IV. SOFTWARE 

Various software was written to set up the ROAGH 
boards, and for transmitting the acquired data to a 
control PG. The setup/control software was written in 
Python. In addition, a simple data server written in 
Python runs on each ROAGH board. This server checks 
the address pointer of the packet buffer and uploads the 
packets to the client, the control PG. At the control PG, 
the data is accumulated with a c-based client, and is writ- 
ten to disk for later analysis. Various "quick-look" tools 
were written to aid in the acquisition and help with the 
telescope pointing. 



V. SETUP 

A probe signal with large power maximizes the signal 
and helps quench noise generated by material defects in 
the resonatopi^. However, the inductance of an MKID 
is approximately linear only at low probe signal powers. 
As the power increases the increasing nonlinear induc- 
tance excessively distorts the resonance loop. We probe 
each resonator with the maximum power tolerated by the 
resonator before these effects appear. 

Due to device variations, this maximum power, Pmax 
is different for each resonator. Pmax generally varies by 
less than 10 dB between the highest and lowest mea- 
sured quality factor (Qm) resonators, which can easily 
be accounted for with the 16 bits of amplitude resolution 
available to the DAG. After Pmax is determined for each 
resonator, the relative amplitudes of the probe signals 
are set for the LUT determination. The output attenua- 
tors are then adjusted to set the overall attenuation such 
that the amplitude of the sum does not exceed the 16 bit 
range available to the DAG. 

The measured quality factor of the resonators deter- 
mines the frequency resolution required for the probe 
signals. We chose a frequency resolution of 7812.5 Hz 



and a DAC sample rate of 512 MSample/s. This means 
the LUT length must be 2^^ samples of 16 bits, for each 
/ and Q. Each probe signal, p*^'\ is defined with 



n(0 



^(l) 



= yl(')exp(27ri/(')n/512e6 + i6i(')), 

where n — [0, 1, . . . , 2^^ — 1], /^^'^ is the Z*'' resonant fre- 
quency, and 9^^> is a randomly selected phase. The ampli- 
tude A^''^ is scaled with respect to the resonator with the 

highest Pmax, for which A*^'' — 1. All pj are summed 
and the maximum amplitude of the resultant waveform, 
Pmaxi is determined. The summed waveform, p„, is then 
normalized by this maximum amplitude times 2^^ (the 
largest peak-to-peak amplitude of the DAC) to give 



Pn = 



Pv 



I 



and casting p„ as a 16 bit integer. Including the random 
phase 6''-''' maximizes the dynamic range of the DAC by 
minimizing Pmax- This is especially important as the 
number of resonators, N, increases since Pmax -^ N-^^^. 
For 256 resonators (the maximum number of channels 
per board), Pmax is typically about 50, which results in 
nearly —34 dB power per resonator with respect to the 
full scale output of a single tone. 

With the powers and frequencies set, the next step is to 
set the threshold separating a pulse due to a photon from 
noise for each resonator. The primary consideration for 
setting this level is the magnitude of the photon pulses 
we expect to reach our device from the sky through the 
layers of blocking infrared filters. A secondary consider- 
ation is the data rate limit that data can be pulled from 
the ROACH boards via the 1 Gb/s ethernet ports. For 
ARCONS, we chose between 3 and 4 standard deviations. 
Over the course of an observing night, the device tem- 
perature slowly drifts up, which requires a re-calibration 
of the pulse threshold approximately every 4 hours. 

The alignment between the ROACH boards and the 
data acquisition computer is provided by a 1 pulse-per- 
second signal provided by the GPS on the data acquisi- 
tion PC. 

The performance of the readout is characterized in 
Figure |8] Since the amplitude of the probe tones arc 
discarded, only a measurement of the phase noise is re- 
quired. In order to simplify the comparison with the de- 
sign requirements listed above, we show phase noise with 
respect to the origin of the IQ plane and away from the 
resonance where the transmission through the array is 
high. The red line of Fig. IsFa) shows the phase noise 
for a single channel with only one readout tone with 
the IF out and IF in connected. This is a measure of 
the noise introduced by the readout electronics indepen- 
dently of the MKID array, HEMT, and auxiliary room 
temperature amplifiers. At mid-range frequencies (100 
Hz - 50 kHz) the noise floor is near the expected value of 
the phase noise contributed by the ADC, -147 dBc/Hz. 



There is excess noise at low frequencies for which we of- 
fer no explanation. At high frequencies, there are spuri- 
ous tones nearly 40 dB above the noise floor. The blue 
line of Fig. IsFa) shows the phase noise for a single chan- 
nel through the array with only one readout tone. The 
green line shows the phase noise for the same channel 
while the full 256 channels are being readout. With the 
exception of the spurious tones above 7 kHz, the noise 
floor of the two are similar and indicate that the read- 
out noise is dominated by the HEMT noise expected to 
be -106 dBc/Hz, meeting the design requirements. The 
spurs appear in harmonics of the frequency resolution of 
the probe tones (7812.5 Hz). We therefore speculate that 
they arise in the IQ mixers as nonlinear mixing products. 
Finally, the ultimate test of our readout scheme is 
the pulse height estimation, and the energy resolution 
thereby derived. In addition to the phase noise, the pulse 
height estimation is also limited by the length, or num- 
ber of taps of the programmable stage filter. To better 
understand this constraint, we examine the phase signal 
in time and the response due to incident photons. Fig- 
ure IsFb) shows the responses to incident 254 nm photons 
in time using two different FIRs. The blue curve was 
taken with the 250 kHz low-pass filter and shows the sud- 
den rise and (roughly) exponential decay to equilibrium, 
with a lifetime of r « 50 /is. The green curve was taken 
with the second stage FIR programmed as a matched 
filter. These data show the symmetric response typical 
of a matched filter. It also exemplifies the limitations 
of our FPGA implementation of this filter. The filter 
is only able to convolve data over 26 /iS, which is nearly 
half of the pulse lifetime, r. Effective matched filtering 
requires that the filter convolution range extend at least 
T. The lack of symmetry in the matched filtered data in 
Figure [sFb) refiects this shortcoming. Future work will 
focus on minimizing the channelization logic in order to 
maximize the length of the programmable filter. 



VI. SUMMARY 

We have demonstrated an effective and scalable sys- 
tem for reading out large arrays of MKIDs using 
open source hardware and software. In addition, all 
firmware and software tools developed here are offered 
as open-source and downloadable the CASPER website, 
https://casper.berkeley.edu. 
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FIG. 8. (a) Phase noise for the readout under three conditions. The red hne is a loop-back test connecting the IF outputs 
and inputs, characterizing the noise contributed by the readout electronics. Aside from the spurious signals, the noise floor is 
close to the expected -147 dBc/Hz contributed by the ADC. The blue and green lines show the phase noise of a single channel 
through the array with one and 256 probe signals respectively. Again, the noise floor is close the the expected -106 dBc/Hz 
contributed by the HEMT amplifier, (b) Typical phase response of a single channel in time due to 254 nm incident photons 
for both a low-pass filter and a matched filter. 
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Appendix A: Polyphase Filterbank Calculations 

The core of the first stage is formed by a twice over- 
sampled discrete Fourier transform (DFT) of length N. 
The DFT is defined as 



n=0 



— nk 



y-n 



(Al) 



where k = [0, 1,...,A^ — 1]. By "twice 

over-sampled," we mean the DFT is per- 
formed on the overlapping input data sets 

[yo, • ■ ■ , VN-i], [yN/2, ■ ■ ■ ,2/3Af/2-l], [VN, ■■■, y2N-l], ■ • ■, 

such that each y„ is input twice to consecutive DFTs. It 
is instructive to examine consecutive outputs of a bin k 
with an arbitrary y„ = e^'^*'^ '^ . The first output of j/fc 
gives 

~(0) _ .„.(fc-fc')f^) sin(^(fc-fcO) 

^'^ -' sin{7r{k-k')/Ny ^^^> 

Subsequent outputs accumulate an additional phase 
forming a time series, y^. , e'^**'' y^. , . . . , e'^*™'^ yj^ , with 



10 



a sample rate reduced by a factor 2/N. In general, the in- 
put wave frequency k' will be non-integral and y^ will 
beat accordingly. For later convenience at the second 
stage, we introduce an additional factor (—1)™'^ — ^^rnkTr 
to equation |A2[ Now, if our desired resonator frequency 
is k' , we choose bin k ^ k' — f where — 1 < / < 1, for 
finer channelization in the second stage 



(m) 

Vk 



e-^rnfy(fi)^ 



(A3) 



Equation |A2| makes it is clear that the magnitude is 
strongly attenuated for frequencies / 7^ with the sine 
function that characterizes DFTs. In order to recover 
flat frequency response and minimize spectral leakage, 
the signal must be filtered (Eq. fl]) before performing the 
DFT. Ideally, we would choose the filter such that the 
sine function in equation |A2| is replaced with a function 
equal to 1 for k' within the A;*'' bin, and outside. The 
(inverse) Fourier transform of such a rectangular filter in 
frequency space is a sine function in the time domain. In 



particular, the coefficients for /i„ are drawn from a sine 
function with M points. 



1 sin [tt (4n/M - 1)] 
{An/M - 1)] 



Mi 



(A4) 



.qM 



where < n < M — 1. The parameter a allows for the 
bin-widths to be scaled arbitrarily, (a = 1 gives bin- 
widths equal to 1/N.) Since our objective is to perform 
a finer channelization on these data for the ultimate 500 
kHz narrow bandwidth, we must scale the bins to overlap 
by at least this amount. To give sufficient bin overlap, it 
was chosen as a = 2.5. 

Since the number of inputs to this filter (convolution) 
is generally large {M > N), the operation is done most 
efficiently on hardware using the "Overlap-add" method. 
In addition, because M is finite, the behavior of the sine 
filter can be improved with a window function, e.g., Ham- 
ming, Gaussian, etc. 



